Substitute Specification 

10/566606 
^ IAP9 Rec'dPCTffTO 3 1 J AH 20011 

SEQUENCE LISTING 



10 



15 



25 



30 



<110> Degussa AG 

<120> Process for the preparation of L-threonine 

<130> 030217BT 

<160> 10 

<170> Patentin version 3.1 



20 <210> 1 

<211> 993 

<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (990) 

<223> rpoS gene 



<400> 1 

atg agt cag aat acg ctg aaa gtt cat gat tta aat gaa gat gcg gaa 48 
Met Ser Gin Asn Thr Leu Lys Val His Asp Leu Asn Glu Asp Ala Glu 
35 1 5 10 15 

ttt gat gag aac gga gtt gag gtt ttt gac gaa aag gcc tta gta gaa 96 
Phe Asp Glu Asn Gly Val Glu Val Phe Asp Glu Lys Ala Leu Val Glu 
20 25 30 

40 

cag gaa ccc agt gat aac gat ttg gcc gaa gag gaa ctg tta teg cag 144 
Gin Glu Pro Ser Asp Asn Asp Leu Ala Glu Glu Glu Leu Leu Ser Gin 
35 40 45 

45 gga gcc aca cag cgt gtg ttg gac gcg act cag ctt tac ctt ggt gag 192 
Gly Ala Thr Gin Arg Val Leu Asp Ala Thr Gin Leu Tyr Leu Gly Glu 
50 55 60 

att ggt tat tea cca ctg tta acg gcc gaa gaa gaa gtt tat ttt gcg 240 
50 lie Gly Tyr Ser Pro Leu Leu Thr Ala Glu Glu Glu Val Tyr Phe Ala 
65 70 75 80 

cgt cgc gca ctg cgt gga gat gtc gcc tct cgc cgc egg atg ate gag 288 
Arg Arg Ala Leu Arg Gly Asp Val Ala Ser Arg Arg Arg Met lie Glu 
55 85 90 95 

agt aac ttg cgt ctg gtg gta aaa att gcc cgc cgt tat ggc aat cgt 336 
Ser Asn Leu Arg Leu Val Val Lys lie Ala Arg Arg Tyr Gly Asn Arg 
100 105 110 

60 



Substitute 



2 



ggt ctg gcg ttg ctg gac ctt ate gaa gag ggc aac ctg ggg ctg ate 
Gly Leu Ala Leu Leu Asp Leu lie Glu Glu Gly Asn Leu Gly Leu lie 
115 120 125 

cgc gcg gta gag aag ttt gac ccg gaa cgt ggt ttc cgc ttc tea aca 
Arg Ala Val Glu Lys Phe Asp Pro Glu Arg Gly Phe Arg Phe Ser Thr 
130 135 140 

tac gca aec tgg tgg att ege eag aeg att gaa egg geg att atg aae 
Tyr Ala Thr Trp Trp lie Arg Gin Thr lie Glu Arg Ala lie Met Asn 
145 150 155 160 

caa ace cgt act att cgt ttg ccg att cae ate gta aag gag ctg aac 
Gin Thr Arg Thr lie Arg Leu Pro lie His lie Val Lys Glu Leu Asn 
165 170 175 

gtt tac ctg ega aee gea cgt gag ttg tec cat aag ctg gae cat gaa 
Val Tyr Leu Arg Thr Ala Arg Glu Leu Ser His Lys Leu Asp His Glu 
180 185 190 

cca agt gcg gaa gag ate gca gag caa ctg gat aag cca gtt gat gac 
Pro Ser Ala Glu Glu lie Ala Glu Gin Leu Asp Lys Pro Val Asp Asp 
195 200 205 

gtc age cgt atg ctt cgt ett aac gag ege att ace teg gta gae aec 
Val Ser Arg Met Leu Arg Leu Asn Glu Arg lie Thr Ser Val Asp Thr 
210 215 220 

ccg ctg ggt ggt gat tec gaa aaa gcg ttg ctg gac ate ctg gee gat 
Pro Leu Gly Gly Asp Ser Glu Lys Ala Leu Leu Asp lie Leu Ala Asp 
225 230 235 240 

gaa aaa gag aae ggt ccg gaa gat ace aeg caa gat gae gat atg aag 
Glu Lys Glu Asn Gly Pro Glu Asp Thr Thr Gin Asp Asp Asp Met Lys 
245 250 255 

cag age ate gtc aaa tgg ctg ttc gag ctg aac gcc aaa cag cgt gaa 
Gin Ser lie Val Lys Trp Leu Phe Glu Leu Asn Ala Lys Gin Arg Glu 
260 265 270 

gtg ctg gca cgt ega ttc ggt ttg ctg ggg tac gaa geg gea aea ctg 
Val Leu Ala Arg Arg Phe Gly Leu Leu Gly Tyr Glu Ala Ala Thr Leu 
275 280 285 

gaa gat gta ggt cgt gaa att ggc etc ace cgt gaa cgt gtt cgc cag 
Glu Asp Val Gly Arg Glu lie Gly Leu Thr Arg Glu Arg Val Arg Gin 
290 295 300 

att cag gtt gaa ggc ctg cgc cgt ttg cgc gaa ate ctg caa aeg eag 
lie Gin Val Glu Gly Leu Arg Arg Leu Arg Glu lie Leu Gin Thr Gin 
305 310 315 320 

ggg ctg aat ate gaa gcg ctg ttc cgc gag taa 
Gly Leu Asn lie Glu Ala Leu Phe Arg Glu 
325 330 

<210> 2 
<211> 330 
<212> PRT 

<213> Escherichia coli 



^ Substitute Specification 

3 



10 



25 



40 



55 



<400> 2 

Met Ser Gin Asn Thr Leu Lys Val His Asp Leu Asn Glu Asp Ala Glu 
15 10 15 

Phe Asp Glu Asn Gly Val Glu Val Phe Asp Glu Lys Ala Leu Val Glu 
20 25 30 

Gin Glu Pro Ser Asp Asn Asp Leu Ala Glu Glu Glu Leu Leu Ser Gin 
35 40 45 

Gly Ala Thr Gin Arg Val Leu Asp Ala Thr Gin Leu Tyr Leu Gly Glu 
50 55 60 



lie Gly Tyr Ser Pro Leu Leu Thr Ala Glu Glu Glu Val Tyr Phe Ala 
15 65 70 75 80 

Arg Arg Ala Leu Arg Gly Asp Val Ala Ser Arg Arg Arg Met lie Glu 
85 90 95 

2 0 Ser Asn Leu Arg Leu Val Val Lys lie Ala Arg Arg Tyr Gly Asn Arg 
100 105 110 



Gly Leu Ala Leu Leu Asp Leu lie Glu Glu Gly Asn Leu Gly Leu lie 

115 120 125 

Arg Ala Val Glu Lys Phe Asp Pro Glu Arg Gly Phe Arg Phe Ser Thr 
130 135 140 



Tyr Ala Thr Trp Trp lie Arg Gin Thr lie Glu Arg Ala lie Met Asn 
30 145 150 155 160 

Gin Thr Arg Thr lie Arg Leu Pro lie His lie Val Lys Glu Leu Asn 
165 170 175 

35 Val Tyr Leu Arg Thr Ala Arg Glu Leu Ser His Lys Leu Asp His Glu 
180 185 190 



Pro Ser Ala Glu Glu lie Ala Glu Gin Leu Asp Lys Pro Val Asp Asp 
195 200 205 

Val Ser Arg Met Leu Arg Leu Asn Glu Arg lie Thr Ser Val Asp Thr 
210 215 220 



Pro Leu Gly Gly Asp Ser Glu Lys Ala Leu Leu Asp lie Leu Ala Asp 
45 225 230 235 240 

Glu Lys Glu Asn Gly Pro Glu Asp Thr Thr Gin Asp Asp Asp Met Lys 
245 250 255 

50 Gin Ser lie Val Lys Trp Leu Phe Glu Leu Asn Ala Lys Gin Arg Glu 
260 265 270 



Val Leu Ala Arg Arg Phe Gly Leu Leu Gly Tyr Glu Ala Ala Thr Leu 

275 280 285 

Glu Asp Val Gly Arg Glu lie Gly Leu Thr Arg Glu Arg Val Arg Gin 
290 295 300 



60 



lie Gin Val Glu Gly Leu Arg Arg Leu Arg Glu lie Leu Gin Thr Gin 
305 310 315 320 



Substitute Speci£ication 



Gly Leu Asn lie Glu Ala Leu Phe Arg Glu 
325 330 

<210> 3 
<211> 993 
<212> DNA 

<213> Escherichia coli 



10 <220> 

<221> Allele 

<222> (1) . . (990) 

<223> rpoS allele 



15 



20 



<220> 

<221> misc_f eature 

<222> (97),. (99) 

<223> amber codon 



<400> 





atgagtcaga 


atacgctgaa 


agttcatgat 


ttaaatgaag 


atgcggaatt 


tgatgagaac 


60 


ZD 


ggagttgagg 


tttttgacga 


aaaggcctta 


gtagaatagg 


aacccagtga 


taacgatttg 


120 




gccgaagagg 


aactgttatc 


gcagggagcc 


acacagcgtg 


tgttggacgc 


gactcagctt 


180 


30 


taccttggtg 


agattggtta 


ttcaccactg 


ttaacggccg 


aagaagaagt 


ttattttgcg 


240 


cgtcgcgcac 


tgcgtggaga 


tgtcgcctct 


cgccgccgga 


tgatcgagag 


taacttgcgt 


300 




ctggtggtaa 


aaattgcccg 


ccgttatggc 


aatcgtggtc 


tggcgttgct 


ggaccttatc 


360 


35 


gaagagggca 


acctggggct 


gatccgcgcg 


gtagagaagt 


ttgacccgga 


acgtggtttc 


420 




cgcttctcaa 


catacgcaac 


ctggtggatt 


cgccagacga 


ttgaacgggc 


gattatgaac 


480 


40 


caaacccgta 


ctattcgttt 


gccgattcac 


atcgtaaagg 


agctgaacgt 


ttacctgcga 


540 


accgcacgtg 


agttgtccca 


taagctggac 


catgaaccaa 


gtgcggaaga 


gatcgcagag 


600 




caactggata 


agccagttga 


tgacgtcagc 


cgtatgcttc 


gtcttaacga 


gcgcattacc 


660 


45 


tcggtagaca 


ccccgctggg 


tggtgattcc 


gaaaaagcgt 


tgctggacat 


cctggccgat 


720 




gaaaaagaga 


acggtccgga 


agataccacg 


caagatgacg 


atatgaagca 


gagcatcgtc 


780 


50 


aaatggctgt 


tcgagctgaa 


cgccaaacag 


cgtgaagtgc 


tggcacgtcg 


attcggtttg 


840 


ctggggtacg 


aagcggcaac 


actggaagat 


gtaggtcgtg 


aaattggcct 


cacccgtgaa 


900 




cgtgttcgcc 


agattcaggt 


tgaaggcctg 


cgccgtttgc 


gcgaaatcct 


gcaaacgcag 


960 


55 


gggctgaata 


tcgaagcgct 


gttccgcgag 


taa 






993 



<210> 4 

<211> 75 

<212> DNA 

60 <213> Escherichia coli 



Substitute Specification 



5 



<220> 

<221> tRNA 

<222> (1) . . (75) 

<223> supE allele 



<400> 4 

tggggtatcg ccaagcggta aggcaccgga ttctaattcc ggcattccga ggttcgaatc 60 

ctcgtacccc agcca 75 

<210> 5 

<211> 1545 

<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (1542) 

<223> ilvA gene 

<400> 5 

atg get gac teg eaa ccc ctg tec ggt get ccg gaa ggt gee gaa tat 48 
Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 
15 10 15 

tta aga gca gtg ctg cgc gcg ccg gtt tac gag gcg gcg cag gtt acg 96 
Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 
20 25 30 

ccg eta eaa aaa atg gaa aaa ctg teg teg cgt ett gat aac gte att 144 
Pro Leu Gin Lys. Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 
35 40 45 

ctg gtg aag cgc gaa gat cgc cag cea gtg cac age ttt aag ctg cgc 192 
Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 
50 55 60 

gge gca tac gee atg atg gcg gge ctg acg gaa gaa cag aaa gcg cac 240 
Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 
65 70 75 . 80 

gge gtg ate act get tet gcg ggt aac cac gcg cag gge gte gcg ttt 288 
Gly Val lie Thr Ala Ser Ala Gly Asn His Ala Gin Gly Val Ala Phe 
85 90 95 

tet tet gcg egg tta gge gtg aag gee ctg ate gtt atg cea ace gee 33 6 

Ser Ser Ala Arg Leu Gly Val Lys Ala Leu lie Val Met Pro Thr Ala 
100 105 110 

ace gee gae ate aaa gte gac gcg gtg cgc gge ttc gge gge gaa gtg 384 
Thr Ala Asp lie Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 

115 120 125 

ctg etc cac gge gcg aac ttt gat gaa gcg aaa gee aaa gcg ate gaa 432 
Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala lie Glu 
130 135 140 

ctg tea cag cag cag ggg ttc ace tgg gtg ccg ccg ttc gae eat ccg 480 
Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 
145 150 155 160 



^ Substitute Specification 



atg gtg att gcc ggg caa ggc acg ctg gcg ctg gaa ctg etc cag cag 528 
Met Val lie Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 

165 170 175 

5 

gac gcc cat etc gae egc gta ttt gtg cea gtc ggc ggc ggc ggt ctg 576 
Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 
180 185 190 

10 get get ggc gtg gcg gtg ctg ate aaa caa ctg atg eeg caa ate aaa 624 
Ala Ala Gly Val Ala Val Leu lie Lys Gin Leu Met Pro Gin lie Lys 
195 200 205 

gtg ate gee gta gaa gcg gaa gac tee gee tge ctg aaa gea gcg ctg 672 
15 Val lie Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 
210 215 220 

gat gcg ggt cat ccg gtt gat ctg eeg egc gta ggg eta ttt get gaa 720 
Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 
20 225 230 235 240 

ggc gta gcg gta aaa egc ate ggt gac gaa ace tte cgt tta tge cag 768 
Gly Val Ala Val Lys Arg lie Gly Asp Glu Thr Phe Arg Leu Cys Gin 
245 250 255 

25 

gag tat etc gae gae ate ate ace gtc gat age gat gcg ate tgt gcg 816 
Glu Tyr Leu Asp Asp lie lie Thr Val Asp Ser Asp Ala lie Cys Ala 
260 265 270 

30 gcg atg aag gat tta tte gaa gat gtg egc gcg gtg gcg gaa ecc tct 864 
Ala Met Lys Asp Leu Phe Glu Asp Val Arg Ala Val Ala Glu Pro Ser 
275 280 285 

ggc gcg ctg gcg ctg gcg gga atg aaa aaa tat ate gcc ctg cae aac 912 
35 Gly Ala Leu Ala Leu Ala Gly Met Lys Lys Tyr lie Ala Leu His Asn 
290 295 300 

att egc ggc gaa egg ctg gcg eat att ett tec ggt gee aac gtg aac 960 
lie Arg Gly Glu Arg Leu Ala His lie Leu Ser Gly Ala Asn Val Asn 
40 305 310 315 320 

tte cae ggc ctg egc tac gtc tea gaa egc tge gaa ctg ggc gaa cag 1008 
Phe His Gly Leu Arg Tyr Val Ser Glu Arg Cys Glu Leu Gly Glu Gin 
325 330 335 



45 



cgt gaa gcg ttg ttg gcg gtg ace att eeg gaa gaa aaa ggc age tte 1056 
Arg Glu Ala Leu Leu Ala Val Thr lie Pro Glu Glu Lys Gly Ser Phe 
340 345 350 



50 etc aaa tte tge caa ctg ett ggc ggg cgt teg gtc ace gag tte aac 1104 
Leu Lys Phe Cys Gin Leu Leu Gly Gly Arg Ser Val Thr Glu Phe Asn 
355 360 365 

tac cgt ttt gee gat gcc aaa aac gcc tge ate ttt gtc ggt gtg egc 1152 
55 Tyr Arg Phe Ala Asp Ala Lys Asn Ala Cys lie Phe Val Gly Val Arg 
370 375 380 

ctg age egc ggc etc gaa gag egc aaa gaa att ttg cag atg etc aac 1200 
Leu Ser Arg Gly Leu Glu Glu Arg Lys Glu lie Leu Gin Met Leu Asn 
60 385 390 395 400 



Substitute Speci£ication 



7 



gac ggc ggc tac age gtg gtt gat etc tec gac gac gaa atg gcg aag 1248 
Asp Gly Gly Tyr Ser Val Val Asp Leu Ser Asp Asp Glu Met Ala Lys 
405 410 415 

5 eta cac gtg cgc tat atg gtc ggc gga cgt cca teg cat ccg ttg cag 1296 
Leu His Val Arg Tyr Met Val Gly Gly Arg Pro Ser His Pro Leu Gin 
420 425 430 

gaa cgc etc tac age ttc gaa ttc ccg gaa tea ccg ggc gcg etg etg 1344 
10 Glu Arg Leu Tyr Ser Phe Glu Phe Pro Glu Ser Pro Gly Ala Leu Leii 
435 440 445 

cgc ttc etc aac acg etg ggt acg tac tgg aac att tet ttg ttc cac 1392 
Arg Phe Leu Asn Thr Leu Gly Thr Tyr Trp Asn lie Ser Leu Phe His 
15 450 455 460 

tat cgc age eat ggc ace gac tac ggg cgc gta etg gcg gcg ttc gaa 1440 
Tyr Arg Ser His Gly Thr Asp Tyr Gly Arg Val Leu Ala Ala Phe Glu 
465 470 475 480 



20 



ett ggc gac cat gaa ccg gat ttc gaa acc egg etg aat gag etg ggc 1488 
Leu Gly Asp His Glu Pro Asp Phe Glu Thr Arg Leu Asn Glu Leu Gly 
485 490 495 



25 tac gat tgc cac gac gaa ace aat aac ccg gcg ttc agg ttc ttt ttg 1536 
Tyr Asp Cys His Asp Glu Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu 
500 505 510 

gcg ggt tag 1545 
30 Ala Gly 



<210> 6 

<211> 514 

35 <212> PRT 

<213> Escherichia coli 



<400> 6 

40 Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 
1 5 10 15 



45 



60 



Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 

20 25 30 

Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 
35 40 45 



Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 
50 50 55 60 

Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 
65 70 75 80 

55 Gly Val lie Thr Ala Ser Ala Gly Asn His Ala Gin Gly Val Ala Phe 

85 90 95 



Ser Ser Ala Arg Leu Gly Val Lys Ala Leu lie Val Met Pro Thr Ala 
100 105 110 

Thr Ala Asp lie Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 
115 120 125 



Substitute Specification 



10 



25 



40 



55 



8 



Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala lie Glu 
130 135 140 

Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 
145 150 155 160 

Met Val lie Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 
165 170 175 

Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 
180 185 190 



Ala Ala Gly Val Ala Val Leu lie Lys Gin Leu Met Pro Gin lie Lys 
15 195 200 205 

Val lie Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 
210 215 220 

20 Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 
225 230 235 240 



Gly Val Ala Val Lys Arg lie Gly Asp Glu Thr Phe Arg Leu Cys Gin 
245 250 255 

Glu Tyr Leu Asp Asp lie lie Thr Val Asp Ser Asp Ala lie Cys Ala 
260 265 270 



Ala Met Lys Asp Leu Phe Glu Asp Val Arg Ala Val Ala Glu Pro Ser 
30 275 280 285 

Gly Ala Leu Ala Leu Ala Gly Met Lys Lys Tyr lie Ala Leu His Asn 
290 295 300 

35 lie Arg Gly Glu Arg Leu Ala His He Leu Ser Gly Ala Asn Val Asn 
305 310 315 320 



Phe His Gly Leu Arg Tyr Val Ser Glu Arg Cys Glu Leu Gly Glu Gin 
325 330 335 

Arg Glu Ala Leu Leu Ala Val Thr He Pro Glu Glu Lys Gly Ser Phe 
340 345 350 



Leu Lys Phe Cys Gin Leu Leu Gly Gly Arg Ser Val Thr Glu Phe Asn 
45 355 360 365 

Tyr Arg Phe Ala Asp Ala Lys Asn Ala Cys He Phe Val Gly Val Arg 
370 375 380 

50 Leu Ser Arg Gly Leu Glu Glu Arg Lys Glu He Leu Gin Met Leu Asn 
385 . 390 395 400 



Asp Gly Gly Tyr Ser Val Val Asp Leu Ser Asp Asp Glu Met Ala Lys 
405 410 415 

Leu His Val Arg Tyr Met Val Gly Gly Arg Pro Ser His Pro Leu Gin 
420 425 430 



Glu Arg Leu Tyr Ser Phe Glu Phe Pro Glu Ser Pro Gly Ala Leu Leu 
60 435 440 445 



Substitute Specification 
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Arg Phe Leu Asn Thr Leu Gly Thr 
450 455 

Tyr Arg Ser His Gly Thr Asp Tyr 
465 470 

Leu Gly Asp His Glu Pro Asp Phe 
485 

Tyr Asp Cys His Asp Glu Thr Asn 
500 

Ala Gly 



Tyr Trp Asn lie Ser Leu Phe His 
460 

Gly Arg Val Leu Ala Ala Phe Glu 
475 480 

Glu Thr Arg Leu Asn Glu Leu Gly 
490 495 

Asn Pro Ala Phe Arg Phe Phe Leu 
505 510 



<210> 7 

<211> 1545 

<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1)..(1542) 

<223> ilvA allele 



<220> 

<221> mutation 
<222> (856) . . (856) 
<223> 



<400> 7 

atg get gac teg caa ccc ctg tec ggt get ecg gaa ggt gee gaa tat 48 

Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 
1.5 10 15 

tta aga gca gtg ctg cgc gcg ccg gtt tac gag gcg gcg cag gtt acg 96 

Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 

20 25 30 

ccg eta caa aaa atg gaa aaa ctg teg teg egt ett gat aac gtc att 144 

Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 
35 40 45 

ctg gtg aag cgc gaa gat cgc cag cca gtg cac age ttt aag ctg cgc 192 

Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 
50 55 60 

gge gca tac gee atg atg gcg ggc ctg acg gaa gaa cag aaa gcg cac 240 

Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 
65 70 75 80 

ggc gtg ate act get tct gcg ggt aac cac gcg cag ggc gtc gcg ttt 288 

Gly Val lie Thr Ala Ser Ala Gly Asn His Ala Gin Gly Val Ala Phe 
85 90 95 



tct tct gcg egg tta gge gtg aag gee ctg ate gtt atg cca ace gee 
Ser Ser Ala Arg Leu Gly Val Lys Ala Leu lie Val Met Pro Thr Ala 
100 105 110 



336 



Substitute Specification 
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acc gcc gac ate aaa gtc gac gcg gtg cgc ggc ttc ggc ggc gaa gtg 384 
Thr Ala Asp lie Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 
115 120 125 

ctg etc cae ggc gcg aac ttt gat gaa gcg aaa gcc aaa gcg ate gaa 432 
Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala lie Glu 
130 135 140 

ctg tea cag cag cag ggg tte aee tgg gtg eeg eeg tte gae eat eeg 480 
Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 
145 150 155 160 

atg gtg att gee ggg caa ggc acg ctg gcg ctg gaa ctg etc cag cag 528 
Met Val lie Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 
165 170 175 

gae gcc eat etc gae ege gta ttt gtg eea gtc ggc ggc ggc ggt etg 576 
Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 
180 185 190 

get get ggc gtg gcg gtg ctg ate aaa caa ctg atg ccg caa ate aaa 624 
Ala Ala Gly Val Ala Val Leu lie Lys Gin Leu Met Pro Gin lie Lys 
195 200 205 

gtg ate gee gta gaa gcg gaa gae tec gee tge etg aaa gea gcg etg 672 
Val He Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 
210 215 220 

gat gcg ggt cat ccg gtt gat ctg eeg cgc gta ggg eta ttt get gaa 720 
Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 
225 230 235 240 

ggc gta gcg gta aaa ege ate ggt gae gaa aee tte cgt tta tge cag 768 
Gly Val Ala Val Lys Arg He Gly Asp Glu Thr Phe Arg Leu Cys Gin 
245 250 255 

gag tat etc gac gac ate ate acc gtc gat age gat gcg ate tgt gcg 816 
Glu Tyr Leu Asp Asp He He Thr Val Asp Ser Asp Ala He Cys Ala 
260 265 270 

gcg atg aag gat tta tte gaa gat gtg ege gcg gtg gcg aaa ece tet 864 
Ala Met Lys Asp Leu Phe Glu Asp Val Arg Ala Val Ala Lys Pro Ser 
275 280 285 

ggc gcg etg gcg etg gcg gga atg aaa aaa tat ate gee etg cae aac 912 
Gly Ala Leu Ala Leu Ala Gly Met Lys Lys Tyr He Ala Leu His Asn 

290 295 300 

att cgc ggc gaa egg etg gcg eat att ett tee ggt gee aac gtg aac 960 
He Arg Gly Glu Arg Leu Ala His He Leu Ser Gly Ala Asn Val Asn 
305 310 315 320 

ttc eac ggc ctg cgc tac gtc tea gaa cgc tge gaa ctg ggc gaa cag 1008 
Phe His Gly Leu Arg Tyr Val Ser Glu Arg Cys Glu Leu Gly Glu Gin 
325 330 335 

cgt gaa gcg ttg ttg gcg gtg acc att eeg gaa gaa aaa ggc age tte 1056 
Arg Glu Ala Leu Leu Ala Val Thr He Pro Glu Glu Lys Gly Ser Phe 
340 345 350 



Sxibstltute Specification 
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etc aaa ttc tgc caa ctg ctt ggc ggg cgt teg gte ace gag ttc aac 1104 
Leu Lys Phe Cys Gin Leu Leu Gly Gly Arg Ser Val Thr Glu Phe Asn 
355 360 365 

tac cgt ttt gcc gat gcc aaa aac gcc tgc ate ttt gtc ggt gtg cgc 1152 
Tyr Arg Phe Ala Asp Ala Lys Asn Ala Cys lie Phe Val Gly Val Arg 
370 375 380 

etg age ege ggc etc gaa gag cgc aaa gaa att ttg eag atg etc aac 1200 
Leu Ser Arg Gly Leu Glu Glu Arg Lys Glu lie Leu Gin Met Leu Asn 
385 390 395 400 

gae ggc ggc tac age gtg gtt gat etc tec gac gac gaa atg gcg aag 1248 
Asp Gly Gly Tyr Ser Val Val Asp Leu Ser Asp Asp Glu Met Ala Lys 
405 410 415 

eta eac gtg ege tat atg gtc ggc gga cgt eca teg cat eeg ttg eag 1296 
Leu His Val Arg Tyr Met Val Gly Gly Arg Pro Ser His Pro Leu Gin 
420 425 430 

gaa cgc etc tac age ttc gaa ttc ccg gaa tea ccg ggc gcg ctg ctg 1344 
Glu Arg Leu Tyr Ser Phe Glu Phe Pro Glu Ser Pro Gly Ala Leu Leu 
435 440 445 

ege ttc etc aac aeg ctg ggt aeg tac tgg aac att tet ttg ttc cae 1392 
Arg Phe Leu Asn Thr Leu Gly Thr Tyr Trp Asn lie Ser Leu Phe His 
450 455 460 

tat cgc age eat ggc acc gac tac ggg cgc gta ctg gcg gcg ttc gaa 1440 
Tyr Arg Ser His Gly Thr Asp Tyr Gly Arg Val Leu Ala Ala Phe Glu 
465 470 475 480 

ctt ggc gac cat gaa ccg gat ttc gaa ace egg etg aat gag etg ggc 1488 
Leu Gly Asp His Glu Pro Asp Phe Glu Thr Arg Leu Asn Glu Leu Gly 
485 490 495 

tac gat tgc cae gac gaa acc aat aac ccg gcg ttc agg ttc ttt ttg 1536 
Tyr Asp Cys His Asp Glu Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu 
500 505 510 

gcg ggt tag 1545 
Ala Gly 



<210> 8 

<211> 514 

<212> PRT 

<213> Escherichia eoli 



<400> 8 

Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 
15 10 15 

Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val . Thr 
20 25 30 

Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 
35 40 45 



Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 
50 55 60 



Substitute Specification 



10 



25 



35 



50 



12 



Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 
65 70 75 80 

Gly Val lie Thr Ala Ser Ala Gly Asn His Ala Gin Gly Val Ala Phe 
85 90 95 

Ser Ser Ala Arg Leu Gly Val Lys Ala Leu lie Val Met Pro Thr Ala 
100 105 110 

Thr Ala Asp He Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 
115 120 125 



Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala He Glu 
15 130 135 140 

Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 
145 150 155 160 

20 Met Val He Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 

165 170 175 



Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 
180 185 190 



Ala Ala Gly Val Ala Val Leu He Lys Gin Leu Met Pro Gin He Lys 

195 200 205 

30 Val He Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 

210 215 220 



Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 

225 230 235 240 

Gly Val Ala Val Lys Arg He Gly Asp Glu Thr Phe Arg Leu Cys Gin 

245 250 255 



Glu Tyr Leu Asp Asp He He Thr Val Asp Ser Asp Ala He Cys Ala 
40 260 265 270 

Ala Met Lys Asp Leu Phe Glu Asp Val Arg Ala Val Ala Lys Pro Ser 
275 280 285 

45 Gly Ala Leu Ala Leu Ala Gly Met Lys Lys Tyr He Ala Leu His Asn 
290 295 300 



He Arg Gly Glu Arg Leu Ala His He Leu Ser Gly Ala Asn Val Asn 

305 310 315 320 

Phe His Gly Leu Arg Tyr Val Ser Glu Arg Cys Glu Leu Gly Glu Gin 

325 330 335 



Arg Glu Ala Leu Leu Ala Val Thr He Pro Glu Glu Lys Gly Ser Phe 
55 340 345 350 

Leu Lys Phe Cys Gin Leu Leu Gly Gly Arg Ser Val Thr Glu Phe Asn 
355 360 365 

60 Tyr Arg Phe Ala Asp Ala Lys Asn Ala Cys He Phe Val Gly Val Arg 
370 375 380 



Sxibstitute Specification 
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Leu Ser Arg Gly Leu Glu Glu Arg Lys Glu lie Leu Gin Met Leu Asn 
385 390 395 400 

Asp Gly Gly Tyr Ser Val Val Asp Leu Ser Asp Asp Glu Met Ala Lys 
5 405 410 415 

Leu His Val Arg Tyr Met Val Gly Gly Arg Pro Ser His Pro Leu Gin 
420 425 430 

10 Glu Arg Leu Tyr Ser Phe Glu Phe Pro Glu Ser Pro Gly Ala Leu Leu 
435 440 445 



15 



50 



55 



60 



Arg Phe Leu Asn Thr Leu Gly Thr Tyr Trp Asn lie Ser Leu Phe His 
450 455 460 

Tyr Arg Ser His Gly Thr Asp Tyr Gly Arg Val Leu Ala Ala Phe Glu 
465 470 475 480 



Leu Gly Asp His Glu Pro Asp Phe Glu Thr Arg Leu Asn Glu Leu Gly 
20 485 490 495 

Tyr Asp Cys His Asp Glu Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu 
500 505 510 

25 Ala Gly 

<210> 9 

<211> 1548 

30 <212> DNA 

<213> Escherichia coli 



<220> 

35 <221> DNA 

<222> (1)..(1548) 
<223> 



40 <220> 

<221> CDS 

<222> (527) . . (952) 

<223> yjgF orf 

45 



<400> 9 
tcgcgatctg 


gtactgtaag 


gggaaataga 


gatgacacac 


gataataaat 


tgcaggttga 


60 


agctattaaa 


cgcggcacgg 


taattgacca 


tatccccgcc 


cagatcggtt 


ttaagctgtt 


120 


gagtctgttc 


aagctgaccg 


aaacggatca 


gcgcatcacc 


attggtctga 


acctgccttc 


180 


tggcgagatg 


ggccgcaaag 


atctgatcaa 


aatcgaaaat 


acctttttga 


gtgaagatca 


240 


agtagatcaa 


ctggcattgt 


atgcgccgca 


agccacggtt 


aaccgtatcg 


acaactatga 


300 


agtggtgggt 


aaatcgcgcc 


caagtctgcc 


ggagcgcatc 


gacaatgtgc 


tggtctgccc 


360 


gaacagcaac 


tgtatcagcc 


atgccgaacc 


ggtttcatcc 


agctttgccg 


tgcgaaaacg 


420 


cgccaatgat 


atcgcgctca 


aatgcaaata 


ctgtgaaaaa 


gagttttccc 


ataatgtggt 


480 



Substitute Specification 
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gctggccaat taattgcggt tggtaataaa agtctggctc cctata atg age cag 535 

Met Ser Gin 
1 

5 act ttt tac cgc tgt aat aaa gga gaa ate atg age aaa act ate geg 583 
Thr Phe Tyr Arg Cys Asn Lys Gly Glu lie Met Ser Lys Thr lie Ala 
5 10 15 

acg gaa aat gca ccg gca get ate ggt eet tae gta eag gge gtt gat 631 
10 Thr Glu Asn Ala Pro Ala Ala lie Gly Pro Tyr Val Gin Gly Val Asp 
20 25 30 35 

etg gge aat atg ate ate aee tee ggt eag ate eeg gta aat eeg aaa 679 
Leu Gly Asn Met lie lie Thr Ser Gly Gin lie Pro Val Asn Pro Lys 
15 40 45 50 

acg gge gaa gta eeg gca gae gtc get gca eag gca cgt eag teg etg 727 
Thr Gly Glu Val Pro Ala Asp Val Ala Ala Gin Ala Arg Gin Ser Leu 
55 60 65 



20 

gat aae gta aaa gcg ate gtc gaa gcc get gge etg aaa gtg gge gae 775 
Asp Asn Val Lys Ala lie Val Glu Ala Ala Gly Leu Lys Val Gly Asp 
70 75 80 

25 ate gtt aaa act ace gtg ttt gta aaa gat etg aae gae tte gca acc 823 
lie Val Lys Thr Thr Val Phe Val Lys Asp Leu Asn Asp Phe Ala Thr 
85 90 95 

gta aae gee act tae gaa gee tte tte ace gaa cac aae gee aee tte 871 
30 Val Asn Ala Thr Tyr Glu Ala Phe Phe Thr Glu His Asn Ala Thr Phe 
100 105 110 115 

eeg gca cgt tet tge gtt gaa gtt gee cgt etg eeg aaa gae gtg aag 919 
Pro Ala Arg Ser Cys Val Glu Val Ala Arg Leu Pro Lys Asp Val Lys 
35 120 125 130 

att gag ate gaa gcg ate get gtt cgt cgc taa tettgatgga aatecgggct 972 
lie Glu lie Glu Ala lie Ala Val Arg Arg 
135 140 

40 

ateatgcccg gattaagtct gatgacaaae geaaaatege etgatgeget acgettatea 1032 

ggectaegtg atteetgeaa tttattgaat ttgttggeeg gataaggeat ttaegecgea 1092 

45 teeggcatga acaaaactea ctttgtctac aatctgaatc ggggctateg tgcccagttt 1152 

attetttatt gecagcegta aegaeggcta tagaaccett tcaccaaetg ggttaatgtc 1212 

atataeectg ecagaatege aaecagecae gggaaatage ttaaeggeag egeetgtaat 1272 

50 

tgeagataae tggeeagcgg tgaaaaegge aatgegatee egaeaateat caegateaeg 1332 

gteatgatca ttaaeggeea egatgeaeag etetgaataa aeggeaeaeg gegggtgcgg 1392 

55 ateatatgca caatcagcgt ttgegacagt aageceaeea eaaaeeatee egaetggaac 1452 

agegtttgcg tttceggegt gttggeatgg aataeeeace acateaggea aaaegtcaaa 1512 

atatcgaaga tegagctgat eggteegaag aagate 1548 

60 



Substitute Specification 



15 



10 



25 



<210> 10 

<211> 141 

<212> PRT 

<213> Escherichia coli 



<400> 10 

Met Ser Gin Thr Phe Tyr Arg Cys Asn Lys Gly Glu He Met Ser Lys 
1 5 10 15 

Thr He Ala Thr Glu Asn Ala Pro Ala Ala He Gly Pro Tyr Val Gin 
20 25 30 



Gly Val Asp Leu Gly Asn Met He He Thr Ser Gly Gin He Pro Val 
15 35 40 45 

Asn Pro Lys Thr Gly Glu Val Pro Ala Asp Val Ala Ala Gin Ala Arg 
50 55 60 

20 Gin Ser Leu Asp Asn Val Lys Ala He Val Glu Ala Ala Gly Leu Lys 
65 70 75 80 



Val Gly Asp He Val Lys Thr Thr Val Phe Val Lys Asp Leu Asn Asp 
85 90 95 

Phe Ala Thr Val Asn Ala Thr Tyr Glu Ala Phe Phe Thr Glu His Asn 
100 105 110 



Ala Thr Phe Pro Ala Arg Ser Cys Val Glu Val Ala Arg Leu Pro Lys 
30 115 120 125 

Asp Val Lys He Glu He Glu Ala He Ala Val Arg Arg 

130 135 140 



